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IN THE CLAIMS 

Please amend the claims as follows: 

1 . (CURRENTLY AMENDED) In a system comprising perspectives and categories, each 
perspective including at least one category representative of that perspective, a computerized 
method for classifying at least one item across multiple perspectives, said computerized method 
comprising: 

associating category features with each category, wherein each of said category features 
represents one of a plurality of tokens; 

producing a category vector for each category, wherein each category vector includes a 
weight corresponding to each category feature, said weight indicative of a degree of association 
between said category feature and said category; 

associating item features with each item, wherein each of said item features represents 
one of a plurality of tokens found in said item; 

producing a feature vector for each item, wherein each feature vector includes said item 
features with a count corresponding to each item feature, said count indicative of the number of 
times said item feature appears in said item; 

multiplying said category vector by said item vector to produce a plurality of category 
scores for each item; and 

for each perspective, across multiple perspectives, classifying an item into a category 
provided said category score exceeds a predetermined threshold. 

2. (CURRENTLY AMENDED) The computerized method of claim 1, wherein 

the count includes at least one subcount indicative of the number of times the item feature 
appears in a particular region of the item, wherein the particular region of the item is a subset of 
the item taken as a whole; and 

for first and second categories, in classifying the item into the second category, 
substituting the at least one subcount for the count of each item feature , d e p e nding depends on 
whether the item was classified into the first category. 
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3 . (CURRENTLY AMENDED) The computerized method of claim 1 , wherein 

for first and second categories, classifying the item into the second category depends at 
least in part on a determination of whether the item was classified into the first category. 

4. (CURRENTLY AMENDED) The computerized method of claim 1 , wherein the 
classifying further comprises: 

comparing a category score for a first item in a first perspective with a category score for 
said first item in a second perspective; and 

modifying the category score in the first perspective in response to the category score in 
the second perspective. 

5. (Cancelled) 

6. (CURRENTLY AMENDED) The computerized method of claim 4, wherein said first 
perspective is an ancestor of said second perspective. 

7. (Cancelled) 

8. (CURRENTLY AMENDED) The computerized method of claim 4, wherein the 
comparing further comprises: creating an ordered list of category scores for all categories in all 
perspectives; 

identifying an item associated with a highest category score in a first perspective; 

identifying said item in a second perspective; and 

decreasing the category score of said item in said second perspective. 

9-10 (Cancelled) 



1 1 . (CURRENTLY AMENDED) The computerized method of claim 8, further comprising 
repeating the identifying and decreasing for every perspective. 
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12. (CURRENTLY AMENDED) The computerized method of claim 1, wherein the weight 
corresponding to said concept node feature is between -1 and 1. 

1 3 . (CURRENTLY AMENDED) A computerized method for classifying text by associating 
at least one of a plurality of features with at least one of a plurality of categories, said 
computerized method comprising at least one of manually or automatically associating at least 
one of said plurality of features to at least a first category, said plurality of features contributing 
to a decision to classify a document or query item into said at least first category. 

14. (CURRENTLY AMENDED) The computerized method of claim 13, further including 
classifying at least one item into said at least one category, provided the item includes a 
predetermined number of said plurality of features associated with said category. 

15. (CURRENTLY AMENDED) The computerized method of claim 13, further comprising 
at least one of manually or automatically associating at least one of a plurality of attributes with 
at least one of said plurality of features, said plurality of attributes contributing to a decision to 
classify an item into said at least one category. 

16. (CURRENTLY AMENDED) The computerized method of claim 1 5, further comprising: 

determining whether said at least one feature was manually associated to said at least first 
category; and 

associating an attribute with said at least one feature that indicates that the feature was 

Edited. 

17-18 (Cancelled) 



19. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
classifying an item into a category, provided the item does not contain a feature whose 
association with said category has a RejectConcept attribute 
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20. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using the presence of a feature in an item for classifying the item that contains the feature or a 
morphological variant of the feature into a category, provided the feature contains an attribute 
associated with the category that declares the feature to be morphologically variable. 

21. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using the presence of a feature in an item for classifying the item that contains the feature into a 
category, provided the feature contains an attribute associated with the category that declares the 
feature to be morphologically invariant. 

22. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using the presence of a feature in an item for avoiding classifying the item that contains a 
morphological variant of a feature into a category, provided the feature contains an attribute 
associated with the category that declares the feature to be morphologically invariant. 

23. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using the presence of a feature in an item for classifying the item that contains a feature or a case 
variant of the feature into a category, provided the feature contains an attribute associated with 
the category that declares the feature to be case insensitive. 

24. (CURRENTLY AMENDED) The computerized method of claim 1 5, further comprising 
using the presence of a feature in an item for classifying the item that contains a feature into a 
category, provided the feature contains an attribute associated with the category that declares the 
feature to be case insensitive. 

25. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using the presence of a feature in an item in avoiding classifying the item that contains a case 
variant of a feature into a category, provided the feature contains an attribute associated with the 
category that declares the feature to be case invariant. 
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26. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
classifying at least one item into at least one of said categories, provided the item contains a 
feature whose association with said at least one category has a DirectHit attribute. 

27. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using an Overlap attribute of a first feature to determine whether to use the presence of the first 
feature in an item, wherein said first feature overlaps a second feature in the item, in classifying 
the item that contains the first feature into a category. 

28. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using the presence of a feature in an item for classifying the item containing an overlapping 
feature into a category, provided the feature contains an attribute associated with the category 
that declares the feature to be overlap insensitive. 

29. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
using the presence of a feature in an item for classifying the item containing a non-overlapping 
feature into a category, provided the feature contains an attribute associated with the category 
that declares the feature to be overlap sensitive. 

30. (CURRENTLY AMENDED The computerized method of claim 1 5, further comprising 
using the presence of a feature in an item in avoiding classifying the item that contains an 
overlapping feature into a category, provided the feature contains an attribute associated with the 
category that declares the feature to be overlap sensitive. 

3 1 . (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
at least one of manually or automatically assigning a weight to the feature, said weight indicative 
of a degree of association between said item and said category. 



32. (CURRENTLY AMENDED) The computerized method of claim 3 1 , further comprising: 
determining whether said weight was manually assigned to said feature; and 
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associating an attribute with said feature that indicates whether the weight was 
WeightEdited. 

33. (CURRENTLY AMENDED) The computerized method of claim 32, further comprising 
at least one of manually or automatically replacing a value for said weight with another value, 
provided the feature does not contains contain an attribute associated with the category that 
declares the feature to be WeightEdited. 



34. (CURRENTLY AMENDED) The computerized method of claim 33, further comprising 
manually replacing a value for said weight with another value, provided the feature contains an 
attribute associated with the category that declares the feature to be WeightEdited. 



35-37 (Cancelled) 



38. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising: 

determining whether at least one of said plurality of features is a stop word; and 
setting an attribute indicating that said feature is a stop word. 

39. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising: 

at least one of manually or automatically determining a scope of at least one of said 
plurality of features; and 

setting an attribute indicating that said at least one feature is for queries only, or for 
documents only, or for both. 



40. (CURRENTLY AMENDED) The computerized method of claim 15, further comprising 
setting an attribute indicating that said feature is FilteredOut, provided said feature has been 
manually or automatically filtered out of a classification. 
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41 . (CURRENTLY AMENDED) The computerized method of claim 3 1 , further comprising 
multiplying said weight by a scaling parameter, provided the decision to classify the item into 
said category was based on at least one feature automatically associated with the category. 

42. (CURRENTLY AMENDED) The computerized method of claim 4 1 , wherein said scaling 
parameter is between 0 and 1. 

43-44 (Cancelled) 

45. (CURRENTLY AMENDED) The computerized method of claim 1, in which the 
multiplying includes using at least one attribute of at least one category vector in determining 
whether to include a document feature of the feature vector in the multiplying. 

46. (CURRENTLY AMENDED) The computerized method of claim 45, in which the using 
at least one attribute includes using at least one of: 

a Stop attribute to indicate whether a feature must constitute something other than a stop 
word to be included in the multiplying; 

a Case attribute to indicate whether a feature must match a letter case specification to be 
included in the multiplying; 

a Stemming attribute to indicate whether a feature includes stemmed word forms to be 
included in the multiplying; and 

a Learned attribute to indicate how a human-specified feature is to be included in the 
multiplying. 

47. (CURRENTLY AMENDED) The computerized method of claim 45, in which the 
perspective to which the category relates determines the value of the at least one attribute. 



48. (CURRENTLY AMENDED) In a system including perspectives and categories, each 
perspective comprising at least one category representative of that perspective, a computerized 
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method for constructing a classifier to classify at least one item across multiple perspectives, the 
computerized method including: 

associating at least one feature with each category, in which each feature is configured 
for being detected in at least a portion of the at least one item for classification of that item; 

determining an initial weight indicating a degree of association between each associated 
feature and category; and 

in which weights for a category are initially related to weights for other categories of the 
same perspective but are initially substantially unrelated to weights for categories in different 
perspectives. 

49. (CURRENTLY AMENDED) The computerized method of claim 48, in which the 
determining the weight indicating a degree of association between each associated feature and 
category includes using the corresponding feature's distribution in training data items tagged to 
categories from the same perspective as the category being associated with the corresponding 
feature. 

50. (CURRENTLY AMENDED) The computerized method of claim 48, in which the 
determining the weight indicating a degree of association between each associated feature and 
category includes receiving a user input specifying the weight. 

5 1 . (CURRENTLY AMENDED) The computerized method of claim 50, further including 
deeming a feature to be unassociated with a category if no user input is received specifying the 
weight corresponding to the feature. 



52. (CURRENTLY AMENDED) The computerized method of claim 48, further including 
deeming a feature to be unassociated with a category if a magnitude of a corresponding weight 
between the feature and the category does not exceed a predetermined threshold value. 
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53. (CURRENTLY AMENDED) The computerized method of claim 52, further including 
specifying the predetermined threshold value, for each perspective, independent of the 
predetermined threshold value for other perspectives. 

54. (CURRENTLY AMENDED) The computerized method of claim 48, further including 
deeming a feature to be unassociated with a category if a number of features associated with the 
category exceeds a predetermined threshold value. 

55. (CURRENTLY AMENDED) The computerized method of claim 48, further including 
limiting how many categories an item can be classified into within a particular perspective. 

56. (CURRENTLY AMENDED) In a system comprising perspectives and categories, each 
perspective comprising at least one category representative of that perspective, the system also 
comprising weights, each weight indicating a degree of an association between a feature and a 
category, a computerized method for classifying at least one item across multiple perspectives, 
the computerized method comprising: 

identifying feature instances in the items; 

representing, for each item, which features were identified in that item and the number of 
instances each such feature was identified in that item; 

computing, for each item, a category score for each category associated with at least one 
feature identified in that item, the computing using the weight associating the category and the at 
least one feature identified in that item; 

selecting one or more categories to represent each perspective according to the category 
scores; and 

classifying the items across the selected categories representing the multiple perspectives. 



57. (CURRENTLY AMENDED) The computerized method of claim 56, in which the 
selecting includes comparing, for each category, the category score to a predetermined threshold 
value of the perspective represented by the category, including filtering out those categories with 
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a category score below the predetermined threshold value, and keeping those remaining 
categories with a category score that equals or exceeds the predetermined threshold value. 

58. (CURRENTLY AMENDED) The computerized method of claim 56, further including, 
for each perspective, limiting the number of categories for that perspective to be less than or 
equal to a category count limit for that perspective, and in which the selecting one or more 
categories to represent each perspective includes eliminating categories in excess of the category 
count limit for that perspective based on their relatively lower category scores. 

59. (CURRENTLY AMENDED) The computerized method of claim 56, in which the items 
include representations of documents or queries. 

60. (CURRENTLY AMENDED) The computerized method of claim 56, in which the 
identifying features includes identifying vocabulary relevant to at least one category. 

61 . (CURRENTLY AMENDED) The computerized method of claim 56, in which the 
identifying feature instances in the items includes identifying features instances in documents, 
and further including using an attribute designating at least one region of the document to which 
the identifying feature instances is limited. 

62. (CURRENTLY AMENDED) The computerized method of claim 56, further including 
receiving user input for determining at least one weight. 

63. (CURRENTLY AMENDED) The computerized method of claim 56, further including 
statistically determining at least one weight using training data. 

64. (CURRENTLY AMENDED) The computerized method of claim claim 56, further 
including providing, for each feature, at least one weight associating that feature with a 
corresponding category, and basing the at least one weight on at least one of: 

automated processing of training data; 
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user-input data; and 

a combination of automated processing of training data and user-input data. 

65. (CURRENTLY AMENDED) The computerized method of claim 64, further including 
specifying, for each perspective, a degree for combining automated processing of training data 
and human-input data for weights associated with categories representative of that perspective. 

66. (CURRENTLY AMENDED) The computerized method of claim 56, further including 
modifying weights initially associating features with one or more categories representing a first 
perspective based on other weights associating features with one or more other categories 
representing one or more perspectives different from the first perspective. 

67. (CURRENTLY AMENDED) The computerized method of claim 66, in which the 
modifying weights includes, if a feature's initial weights indicates that the feature is strongly 
correlated with at least one category in a first perspective and weakly correlated to the categories 
of different perspectives, then doing at least one of: 

reducing the feature's weights to the categories of the different perspectives; and 
increasing the feature's weight to the category of the first perspective. 

68. (CURRENTLY AMENDED) The computerized method of claim 56, further including 
incorporating a dependence between an item's category score for a category representing a first 
perspective and the item's category score for one or more other categories representing one or 
more perspectives different from the first perspective. 

69. (CURRENTLY AMENDED) The computerized method of claim #S 68, in which the 
incorporating the dependence includes, if the item's category score for a category representing a 
first perspective equals or exceeds a threshold value, then inhibiting classification of the item to 
one or more other categories representing one or more perspectives different from the first 
perspective. 
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70. (CURRENTLY AMENDED) The computerized method of claim 68, in which the 
incorporating the dependence includes, if the item's category score for a category representing a 
first perspective equals or exceeds a threshold value, then reducing the item's category score for 
one or more other categories representing one or more perspectives different from the first 
perspective. 

71. (CURRENTLY AMENDED) The computerized method of claim 56, in which the items 
are documents, and in which the classifying the items includes limiting the regions of the 
document that are used for classifying the documents to at least one category in a second 
perspective based on a characteristic of a classification of the document to at least one category 
in the first perspective. 

72. (CURRENTLY AMENDED) In a system for classifying items to categories, a 
computerized method including: 

receiving user-input defining all associations between classification features and 
categories; and 

statistically determining weights corresponding to the user-defined associations, each 
weight indicating a degree to which the association's feature identifies the association's category 
and discriminates against other categories. 

73. (CURRENTLY AMENDED) The computerized method of claim 72, further including 
identifying candidate features in documents or queries, in which the candidate features include 
words or phrases in the documents or queries. 

74. (CURRENTLY AMENDED) The computerized method of claim 72, further including 
providing attributes for the features. 
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75. (CURRENTLY AMENDED) The computerized method of claim 74, in which the 
providing attributes includes providing at least one of: 
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an Exact Match attribute to indicate whether a match to the feature requires both 
matching case and matching a stemming form; 

a Case attribute to indicate whether a match to the feature requires matching case; 

a Stemming attribute to indicate whether a match to the feature requires matching a 
stemming form; and 

an EmbeddedTermsAllowed attribute to indicate whether a match to a first feature 
precludes a match to one or more other features embedded within the first feature. 

76. (CURRENTLY AMENDED) The computerized method of claim 72, further including 
providing attributes for the associations of features to categories. 

77. (CURRENTLY AMENDED) The computerized method of claim 76, in which providing 
attributes includes providing at least one of: 

an Edited attribute indicating whether the association has been specified in a Recorded 
Evidence Edits table; 

a WeightEdited attribute indicating whether a weight of the association was specified or 
edited by a human user; 

a Stop attribute indicating whether the feature is a stop word; 

a Scope of Feature attribute indicating whether the association of the feature to the 
category applies to topic spotting of queries only, topic spotting of documents only, or topic 
spotting of both queries and documents; 

a FilteredOut attribute indicating whether the feature should be disregarded during topic 
spotting; 

a DirectHit attribute that, if asserted, indicates that a document or query including the 
feature should be tagged to the category specified in the association between feature and 
category bearing the DirectHit attribute, regardless of what other features are included in the 
document or query; 

a RejectConcept attribute that, if asserted, indicates that a document or query including 
the feature should not be tagged to the category specified in the association between the feature 
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and category bearing the RejectConcept attribute, regardless of what other features are included 
in the document or query; 

a Case attribute that indicates whether a to indicate whether a match in to the feature, in a 
document or query, requires matching case of the feature; 

a Stemming attribute to indicate whether a match to the feature, in a document or query, 
requires matching a stemming form; and 

an EmbeddedTermsAllowed attribute to indicate whether a match to a first feature, in a 
document or query, precludes a match to one or more other features embedded within the first 
feature, in the document or query. 

78. (CURRENTLY AMENDED) The computerized method of claim 72, further comprising 
receiving user input for overriding at ieast one of the computed weights indicating a strength of 
the association of a feature to a category. 

79. (CURRENTLY AMENDED) The computerized method of claim 78, in which the 
overriding at least one of the computed weights includes increasing the weight's strength of 
association of a feature and a category. 

80. (CURRENTLY AMENDED) The computerized method of claim 72, in which the 
statistically determining weights is based at least in part on how often the associations' features 
are present in a set of training items. 

8 1 . (CURRENTLY AMENDED) The computerized method of claim 72, in which the 
determining the weight indicating a degree of association between each associated feature and 
category includes receiving a user input specifying the weight. 



82. (CURRENTLY AMENDED) The computerized method of claim 81, further including 
deeming a feature to be unassociated with a category if no user input is received specifying the 
weight corresponding to the feature. 
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83 . (CURRENTLY AMENDED) The computerized method of claim 72, further including 
receiving user input for determining at least one weight. 

84. (CURRENTLY AMENDED) The computerized method of claim 72, further including 
statistically determining at least one weight using training data. 

85. (CURRENTLY AMENDED) The computerized method of claim 84, in which the 
training data consists e ss e ntially of at least one document associated with a category. 

86. (CURRENTLY AMENDED) The computerized method of claim 84, in which the 
training data consists e ss e ntially of at least one user-specified association between a feature and a 
category. 

87. (CURRENTLY AMENDED) The computerized method of claim claim 72, further 
including providing, for each feature, at least one weight associating that feature with a 
corresponding category, and basing the at least one weight on at least one of: 

automated processing of training data; 
user-input data; and 

a combination of automated processing of training data and user-input data. 

88. (CURRENTLY AMENDED) The computerized method of claim 87, further including 
specifying, for each perspective, a degree for combining automated processing of training data 
and human-input data for weights associated with categories representative of that perspective. 

89. (CURRENTLY AMENDED) In a system for classifying items to categories, a 
computerized method including: 

receiving user-input creating user-defined associations between classification features 
and categories; 

statistically determining machine-defined associations that are capable of being different 
from the user-defined associations; and 
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classifying items to the categories using weights corresponding to the user-defined 
associations and the machine-defined associations. 

90. (CURRENTLY AMENDED) The computerized method of claim 89, further including 
controlling relative contributions of the weights corresponding to the user-defined associations 
and the weights corresponding to the machine-defined associations. 

9 1 . (CURRENTLY AMENDED) The computerized method of claim 90, in which the 
controlling includes obtaining a g e n e rally greater relative contribution of the user-defined 
associations with respect to the machine-defined associations. 



92. (CURRENTLY AMENDED) The computerized method of claim 89, in which the 
classifying includes classifying the items to categories spanning multiple perspectives. 



